Exact Dependence Analysis for Increased Communication Overlap
نویسندگان
چکیده
MPI programs are often challenged to scale up to several million cores. In doing so, the programmer tunes every aspect of the application code. However, for large applications, this is often not practical and expensive tracing tools and post-mortem analysis are employed to guide the tuning efforts finding hot-spots and performance bottlenecks. In this paper we revive the use of compiler analysis techniques to automatically unveil opportunities for communication/computation overlap using the result of exact data dependence analysis provided by the polyhedral model. We apply our technique to a 5-point stencil code showing performance improvements up to 28% using 512 cores.
منابع مشابه
Asymptotic Analysis of Binary Gas Mixture Separation by Nanometric Tubular Ceramic Membranes: Cocurrent and Countercurrent Flow Patterns
Analytical gas-permeation models for predicting the separation process across membranes (exit compositions and area requirement) constitutes an important and necessary step in understanding the overall performance of membrane modules. But, the exact (numerical) solution methods suffer from the complexity of the solution. Therefore, solutions of nonlinear ordinary differential equations th...
متن کاملRelationship between dependence on modern communication tools and psychological well-being in students
Nowadays due to technological advance and its direct impact on the quality of human life, addressing the issue of health and its determinants is very important. Maintaining public health is one of the most fundamental issues in each community. The aim of the study was to survey relationship between dependence on modern communication tools (internet and mobile phones) and psychological well-bein...
متن کاملExact and Efficient Computation of the Expected Number of Missing and Common Words in Random Texts
The number of missing words (NMW) of length q in a text, and the number of common words (NCW) of two texts are useful text statistics. Knowing the distribution of the NMW in a random text is essential for the construction of so-called monkey tests for pseudorandom number generators. Knowledge of the distribution of the NCW of two independent random texts is useful for the average case analysis ...
متن کاملToucan - A Translator for Communication Tolerant MPI Applications
We discuss early results with Toucan, a sourceto-source translator that automatically restructures C/C++ MPI applications to overlap communication with computation. We co-designed the translator and runtime system to enable dynamic, dependence-driven execution of MPI applications, and require only a modest amount of programmer annotation. Co-design was essential to realizing overlap through dyn...
متن کاملRecent Results Using the Overlap Dirac Operator
The overlap Dirac operator, derived from the overlap formalism for the special case of vector gauge theories, is a way to realize exact chiral symmetry on the lattice. Exact chiral symmetry on the lattice does come at a price – numerical implementation of the overlap Dirac operator is significantly more expensive than Wilson or staggered operator. In spite of this numerical hurdle, we already h...
متن کامل